Dataset statistics
| Number of variables | 4 |
|---|---|
| Number of observations | 1247 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 38 |
| Duplicate rows (%) | 3.0% |
| Total size in memory | 388.0 KiB |
| Average record size in memory | 318.6 B |
Variable types
| Text | 3 |
|---|---|
| Categorical | 1 |
| Dataset has 38 (3.0%) duplicate rows | Duplicates |
Reproduction
| Analysis started | 2024-02-06 17:05:58.144966 |
|---|---|
| Analysis finished | 2024-02-06 17:06:09.226954 |
| Duration | 11.08 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
Nome
Text
| Distinct | 870 |
|---|---|
| Distinct (%) | 69.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.1 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 17 |
| Mean length | 7.723336 |
| Min length | 3 |
Characters and Unicode
| Total characters | 9631 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 554 ? |
|---|---|
| Unique (%) | 44.4% |
Sample
| 1st row | Bulbasaur |
|---|---|
| 2nd row | Ivysaur |
| 3rd row | Venusaur |
| 4th row | Charmander |
| 5th row | Charmeleon |
| Value | Count | Frequency (%) |
| galarian | 12 | 0.9% |
| indeedee | 7 | 0.6% |
| silicobra | 5 | 0.4% |
| sandaconda | 5 | 0.4% |
| darmanitan | 5 | 0.4% |
| morgrem | 4 | 0.3% |
| tapu | 4 | 0.3% |
| rapidash | 4 | 0.3% |
| corsola | 4 | 0.3% |
| eiscue | 4 | 0.3% |
| Other values (860) | 1214 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 918 | 9.5% |
| e | 805 | 8.4% |
| o | 769 | 8.0% |
| r | 734 | 7.6% |
| i | 687 | 7.1% |
| n | 590 | 6.1% |
| l | 568 | 5.9% |
| t | 444 | 4.6% |
| u | 388 | 4.0% |
| s | 322 | 3.3% |
| Other values (51) | 3406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8325 | |
| Uppercase Letter | 1270 | 13.2% |
| Space Separator | 21 | 0.2% |
| Other Punctuation | 7 | 0.1% |
| Dash Punctuation | 5 | 0.1% |
| Other Symbol | 2 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 918 | |
| e | 805 | 9.7% |
| o | 769 | 9.2% |
| r | 734 | 8.8% |
| i | 687 | 8.3% |
| n | 590 | 7.1% |
| l | 568 | 6.8% |
| t | 444 | 5.3% |
| u | 388 | 4.7% |
| s | 322 | 3.9% |
| Other values (17) | 2100 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 170 | |
| C | 106 | 8.3% |
| M | 91 | 7.2% |
| G | 88 | 6.9% |
| P | 86 | 6.8% |
| D | 81 | 6.4% |
| T | 77 | 6.1% |
| B | 67 | 5.3% |
| A | 59 | 4.6% |
| F | 54 | 4.3% |
| Other values (16) | 391 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| : | 2 | |
| ' | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| ♂ | 1 | |
| ♀ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 21 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9595 | |
| Common | 36 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 918 | 9.6% |
| e | 805 | 8.4% |
| o | 769 | 8.0% |
| r | 734 | 7.6% |
| i | 687 | 7.2% |
| n | 590 | 6.1% |
| l | 568 | 5.9% |
| t | 444 | 4.6% |
| u | 388 | 4.0% |
| s | 322 | 3.4% |
| Other values (43) | 3370 |
Common
| Value | Count | Frequency (%) |
| 21 | ||
| - | 5 | 13.9% |
| . | 3 | 8.3% |
| : | 2 | 5.6% |
| ' | 2 | 5.6% |
| ♂ | 1 | 2.8% |
| ♀ | 1 | 2.8% |
| 2 | 1 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9625 | |
| None | 4 | < 0.1% |
| Misc Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 918 | 9.5% |
| e | 805 | 8.4% |
| o | 769 | 8.0% |
| r | 734 | 7.6% |
| i | 687 | 7.1% |
| n | 590 | 6.1% |
| l | 568 | 5.9% |
| t | 444 | 4.6% |
| u | 388 | 4.0% |
| s | 322 | 3.3% |
| Other values (48) | 3400 |
None
| Value | Count | Frequency (%) |
| é | 4 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 1 | |
| ♀ | 1 |
Tipo
Text
| Distinct | 209 |
|---|---|
| Distinct (%) | 16.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 99.0 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 10.092221 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12585 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 50 ? |
|---|---|
| Unique (%) | 4.0% |
Sample
| 1st row | Planta, Venenoso |
|---|---|
| 2nd row | Planta, Venenoso |
| 3rd row | Planta, Venenoso |
| 4th row | Fogo |
| 5th row | Fogo |
| Value | Count | Frequency (%) |
| água | 181 | 9.6% |
| normal | 150 | 8.0% |
| planta | 148 | 7.9% |
| voador | 136 | 7.2% |
| psíquico | 123 | 6.5% |
| inseto | 113 | 6.0% |
| fogo | 96 | 5.1% |
| fada | 91 | 4.8% |
| lutador | 89 | 4.7% |
| venenoso | 87 | 4.6% |
| Other values (15) | 671 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1587 | 12.6% |
| a | 1533 | 12.2% |
| r | 932 | 7.4% |
| e | 694 | 5.5% |
| t | 690 | 5.5% |
| , | 638 | 5.1% |
| 638 | 5.1% | |
| n | 580 | 4.6% |
| l | 504 | 4.0% |
| s | 469 | 3.7% |
| Other values (28) | 4320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9424 | |
| Uppercase Letter | 1885 | 15.0% |
| Other Punctuation | 638 | 5.1% |
| Space Separator | 638 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1587 | |
| a | 1533 | |
| r | 932 | |
| e | 694 | |
| t | 690 | |
| n | 580 | 6.2% |
| l | 504 | 5.3% |
| s | 469 | 5.0% |
| u | 446 | 4.7% |
| d | 363 | 3.9% |
| Other values (11) | 1626 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 318 | |
| F | 266 | |
| V | 230 | |
| N | 202 | |
| Á | 181 | |
| I | 113 | 6.0% |
| T | 97 | 5.1% |
| L | 89 | 4.7% |
| E | 88 | 4.7% |
| D | 82 | 4.4% |
| Other values (5) | 219 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 638 |
Space Separator
| Value | Count | Frequency (%) |
| 638 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11309 | |
| Common | 1276 | 10.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1587 | |
| a | 1533 | |
| r | 932 | 8.2% |
| e | 694 | 6.1% |
| t | 690 | 6.1% |
| n | 580 | 5.1% |
| l | 504 | 4.5% |
| s | 469 | 4.1% |
| u | 446 | 3.9% |
| d | 363 | 3.2% |
| Other values (26) | 3511 |
Common
| Value | Count | Frequency (%) |
| , | 638 | |
| 638 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12086 | |
| None | 499 | 4.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1587 | |
| a | 1533 | |
| r | 932 | 7.7% |
| e | 694 | 5.7% |
| t | 690 | 5.7% |
| , | 638 | 5.3% |
| 638 | 5.3% | |
| n | 580 | 4.8% |
| l | 504 | 4.2% |
| s | 469 | 3.9% |
| Other values (23) | 3821 |
None
| Value | Count | Frequency (%) |
| Á | 181 | |
| í | 123 | |
| é | 87 | |
| ã | 82 | |
| ç | 26 | 5.2% |
Habilidades
Text
| Distinct | 187 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 130.0 KiB |
Length
| Max length | 38 |
|---|---|
| Median length | 34 |
| Mean length | 24.806736 |
| Min length | 11 |
Characters and Unicode
| Total characters | 30934 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | Raio Solar, Veneno Ácido |
|---|---|
| 2nd row | Raio Solar, Veneno Ácido |
| 3rd row | Raio Solar, Veneno Ácido |
| 4th row | Chama, Investida de Fogo |
| 5th row | Chama, Investida de Fogo |
| Value | Count | Frequency (%) |
| investida | 764 | 16.6% |
| de | 527 | 11.4% |
| raio | 370 | 8.0% |
| pedra | 225 | 4.9% |
| surf | 174 | 3.8% |
| confusão | 165 | 3.6% |
| solar | 119 | 2.6% |
| vento | 113 | 2.4% |
| cortante | 112 | 2.4% |
| psíquico | 106 | 2.3% |
| Other values (59) | 1940 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3368 | 10.9% | |
| a | 2899 | 9.4% |
| o | 2802 | 9.1% |
| e | 2353 | 7.6% |
| i | 1931 | 6.2% |
| d | 1817 | 5.9% |
| n | 1539 | 5.0% |
| r | 1421 | 4.6% |
| t | 1292 | 4.2% |
| , | 1247 | 4.0% |
| Other values (41) | 10265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22278 | |
| Uppercase Letter | 4041 | 13.1% |
| Space Separator | 3368 | 10.9% |
| Other Punctuation | 1247 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2899 | |
| o | 2802 | |
| e | 2353 | |
| i | 1931 | |
| d | 1817 | |
| n | 1539 | 6.9% |
| r | 1421 | 6.4% |
| t | 1292 | 5.8% |
| s | 1216 | 5.5% |
| v | 814 | 3.7% |
| Other values (20) | 4194 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 764 | |
| S | 542 | |
| P | 470 | |
| C | 433 | |
| R | 370 | |
| V | 246 | 6.1% |
| D | 240 | 5.9% |
| Á | 164 | 4.1% |
| G | 149 | 3.7% |
| A | 140 | 3.5% |
| Other values (9) | 523 |
Space Separator
| Value | Count | Frequency (%) |
| 3368 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1247 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26319 | |
| Common | 4615 | 14.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2899 | 11.0% |
| o | 2802 | 10.6% |
| e | 2353 | 8.9% |
| i | 1931 | 7.3% |
| d | 1817 | 6.9% |
| n | 1539 | 5.8% |
| r | 1421 | 5.4% |
| t | 1292 | 4.9% |
| s | 1216 | 4.6% |
| v | 814 | 3.1% |
| Other values (39) | 8235 |
Common
| Value | Count | Frequency (%) |
| 3368 | ||
| , | 1247 | 27.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30235 | |
| None | 699 | 2.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3368 | 11.1% | |
| a | 2899 | 9.6% |
| o | 2802 | 9.3% |
| e | 2353 | 7.8% |
| i | 1931 | 6.4% |
| d | 1817 | 6.0% |
| n | 1539 | 5.1% |
| r | 1421 | 4.7% |
| t | 1292 | 4.3% |
| , | 1247 | 4.1% |
| Other values (33) | 9566 |
None
| Value | Count | Frequency (%) |
| ã | 216 | |
| Á | 164 | |
| í | 106 | |
| â | 94 | |
| ô | 80 | 11.4% |
| ó | 25 | 3.6% |
| á | 11 | 1.6% |
| é | 3 | 0.4% |
Geração
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.2 KiB |
| Quarta | |
|---|---|
| Quinta | |
| Primeira | |
| Terceira | |
| Oitava | |
| Other values (3) |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.4811548 |
| Min length | 5 |
Characters and Unicode
| Total characters | 8082 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Primeira |
|---|---|
| 2nd row | Primeira |
| 3rd row | Primeira |
| 4th row | Primeira |
| 5th row | Primeira |
Common Values
| Value | Count | Frequency (%) |
| Quarta | 432 | |
| Quinta | 155 | 12.4% |
| Primeira | 151 | 12.1% |
| Terceira | 135 | 10.8% |
| Oitava | 113 | 9.1% |
| Segunda | 100 | 8.0% |
| Sétima | 89 | 7.1% |
| Sexta | 72 | 5.8% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| quarta | 432 | |
| quinta | 155 | 12.4% |
| primeira | 151 | 12.1% |
| terceira | 135 | 10.8% |
| oitava | 113 | 9.1% |
| segunda | 100 | 8.0% |
| sétima | 89 | 7.1% |
| sexta | 72 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1792 | |
| r | 1004 | |
| t | 861 | |
| i | 794 | |
| u | 687 | 8.5% |
| e | 593 | 7.3% |
| Q | 587 | 7.3% |
| S | 261 | 3.2% |
| n | 255 | 3.2% |
| m | 240 | 3.0% |
| Other values (9) | 1008 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6835 | |
| Uppercase Letter | 1247 | 15.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1792 | |
| r | 1004 | |
| t | 861 | |
| i | 794 | |
| u | 687 | 10.1% |
| e | 593 | 8.7% |
| n | 255 | 3.7% |
| m | 240 | 3.5% |
| c | 135 | 2.0% |
| v | 113 | 1.7% |
| Other values (4) | 361 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Q | 587 | |
| S | 261 | |
| P | 151 | 12.1% |
| T | 135 | 10.8% |
| O | 113 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8082 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1792 | |
| r | 1004 | |
| t | 861 | |
| i | 794 | |
| u | 687 | 8.5% |
| e | 593 | 7.3% |
| Q | 587 | 7.3% |
| S | 261 | 3.2% |
| n | 255 | 3.2% |
| m | 240 | 3.0% |
| Other values (9) | 1008 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7993 | |
| None | 89 | 1.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1792 | |
| r | 1004 | |
| t | 861 | |
| i | 794 | |
| u | 687 | 8.6% |
| e | 593 | 7.4% |
| Q | 587 | 7.3% |
| S | 261 | 3.3% |
| n | 255 | 3.2% |
| m | 240 | 3.0% |
| Other values (8) | 919 |
None
| Value | Count | Frequency (%) |
| é | 89 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| Nome | Tipo | Habilidades | Geração | |
|---|---|---|---|---|
| 0 | Bulbasaur | Planta, Venenoso | Raio Solar, Veneno Ácido | Primeira |
| 1 | Ivysaur | Planta, Venenoso | Raio Solar, Veneno Ácido | Primeira |
| 2 | Venusaur | Planta, Venenoso | Raio Solar, Veneno Ácido | Primeira |
| 3 | Charmander | Fogo | Chama, Investida de Fogo | Primeira |
| 4 | Charmeleon | Fogo | Chama, Investida de Fogo | Primeira |
| 5 | Charizard | Fogo, Voador | Chama, Bico Perfurante | Primeira |
| 6 | Squirtle | Água | Surf, Jato de Água | Primeira |
| 7 | Wartortle | Água | Surf, Jato de Água | Primeira |
| 8 | Blastoise | Água | Surf, Jato de Água | Primeira |
| 9 | Caterpie | Inseto | Investida, Pó Venenoso | Primeira |
| Nome | Tipo | Habilidades | Geração | |
|---|---|---|---|---|
| 1237 | Morgrem | Noturno, Fada | Punho Sombrio, Desejo Misterioso | Oitava |
| 1238 | Perrserker | Aço | Asas de Ferro, Corte Feroz | Oitava |
| 1239 | Copperajah | Metal, Elétrico | Corte Feroz, Choque do Trovão | Oitava |
| 1240 | Falinks | Lutador | Soco Dinâmico, Investida | Oitava |
| 1241 | Pincurchin | Elétrico | Choque do Trovão, Esfera Aura | Oitava |
| 1242 | Cursola | Fantasma | Bola Sombria, Confusão | Oitava |
| 1243 | Runerigus | Fantasma, Terra | Bola Sombria, Terremoto | Oitava |
| 1244 | Stonjourner | Rocha | Pedra Afiada, Investida | Oitava |
| 1245 | Eiscue | Gelo | Raio de Gelo, Tumulto | Oitava |
| 1246 | Indeedee | Psíquico, Normal | Confusão, Desejo | Oitava |
Most frequently occurring
| Nome | Tipo | Habilidades | Geração | # duplicates | |
|---|---|---|---|---|---|
| 26 | Indeedee | Psíquico, Normal | Confusão, Desejo | Oitava | 6 |
| 0 | Appletun | Planta, Dragão | Raio Solar, Garra Dragônica | Oitava | 3 |
| 1 | Applin | Planta, Dragão | Raio Solar, Garra Dragônica | Oitava | 3 |
| 2 | Arrokuda | Água | Surf, Aqua Jet | Oitava | 3 |
| 3 | Boltent | Elétrico | Raio Veloz, Choque do Trovão | Oitava | 3 |
| 5 | Coalossal | Fogo, Rocha | Chama, Pedra Afiada | Oitava | 3 |
| 6 | Copperajah | Metal, Elétrico | Corte Feroz, Choque do Trovão | Oitava | 3 |
| 8 | Cramorant | Voador, Água | Surf, Bico Perfurante | Oitava | 3 |
| 9 | Cursola | Fantasma | Bola Sombria, Confusão | Oitava | 3 |
| 10 | Drednaw | Água, Rocha | Surf, Investida de Pedra | Oitava | 3 |